Automated Hierarchical Patent Classification (Bachelor’s Thesis Proposal)
نویسنده
چکیده
The number of patent applications is growing steadily over the years. In the year 2014, the U.S. Patent and Trademark Office registered 615,243 filed patent applications. In the year 2000, the number was only as high as 315,015 [1]. In order to efficiently process and search those applications, patent documents are annotated with the different areas of technology to which they belong. This process of annotating the documents can be automated with software systems. Given a patent document and a patent classification scheme, such systems automatically produce suggestions for annotations [2]. Typically, this is achieved by using a machine learning-based classification algorithm trained on a set of annotated patents. The predominantly used annotation scheme is the International Patent Classification (IPC) [3]. The IPC’s structure is hierarchical: It is comprised of a tree of categories with height 13 [4]. For example, a patent could belong to the categories (1) physical analysis of biological material, (2) physical analysis of liquid biological material, and (3) physical analysis of blood [3]. Here, (2) is a subcategory of (1) and (3) is a subcategory of (2). Other patent classification have a similar hierarchical layout [5]. This hierarchical structure poses an interesting challenge for automated patent classification. As opposed to many other classification problems, a classifier would have to take a hierarchy of target classes into account.
منابع مشابه
A Signature Approach to Patent Classification
We propose a document signature approach to patent classification. Automatic patent classification is a challenging task because of the fast growing number of patent applications filed every year and the complexity, size and nested hierarchical structure of patent taxonomies. In our proposal, the classification of a target patent is achieved through a k-nearest neighbour search using Hamming di...
متن کاملAn Intelligent System for Automated Binary Knowledge Document Classification and Content Analysis
Many companies rely on patent engineers to search patent documents and offer recommendations and advice to R&D engineers. Given the increasing number of patent documents filed each year, new means to effectively and efficiently identify and manage technology specific patent documents are required. This research applies a back-propagation artificial neural network (BPANN), a hierarchical ontolog...
متن کاملAutomated Patent Categorization and Guided Patent Search using IPC as Inspired by MeSH and PubMed
Document search on PubMed, the pre-eminent database for biomedical literature, relies on the annotation of its documents with relevant terms from the Medical Subject Headings ontology (MeSH) for improving recall through query expansion. Patent documents are another important information source, though they are considerably less accessible. One option to expand patent search beyond pure keywords...
متن کاملMental Arithmetic Task Recognition Using Effective Connectivity and Hierarchical Feature Selection From EEG Signals
Introduction: Mental arithmetic analysis based on Electroencephalogram (EEG) signal for monitoring the state of the user’s brain functioning can be helpful for understanding some psychological disorders such as attention deficit hyperactivity disorder, autism spectrum disorder, or dyscalculia where the difficulty in learning or understanding the arithmetic exists. Most mental arithmetic recogni...
متن کاملUsing the Multi-level Classification Method in the Patent Mining Task at NTCIR-7
A patent includes a great deal of practical technical information, and plays an important role in promoting scientific development. The research on patent classification and retrieval has significant application value. A patent is a special technical text with strict hierarchical classification system and normalized structure, and there are a number of relations between patents and their consti...
متن کامل